Extraction of Definitional Contexts using Lexical Relations
نویسندگان
چکیده
In this paper we present a method for automatically extracting definitional contexts from restricted domains in Spanish. Definitional contexts are textual fragments where there is an implicit definition that can be identified by taking into account verbal patterns linking a term and its corresponding definition. Our interest is in definitional contexts with analytical definitions. Therefore, we focus on the extraction of textual fragments with a term and a hypernym. Then, hypernym is used for filtering non-relevant contexts by means of the occurrence frequency. It is assumed most frequent hypernyms have a higher probability of giving true definitional contexts than those less frequent hypernyms. We captured regularity in analytical definitions by means of a chunk grammar. This method achieves acceptable results in precision and recall compared with other works. General Terms Automatic Extraction of Conceptual Information.
منابع مشابه
Recognition and extraction of definitional contexts in Spanish for sketching a lexical network
In this paper we propose a method to exploit analytical definitions extracted from Spanish corpora, in order to build a lexical network based on the hyponymy/hyperonymy, part/whole and attribution relations. Our method considers the following steps: (a) the recognition and extraction of definitional contexts from specialized documents, (b) the identification of analytical definitions on these d...
متن کاملDesign of a Extraction System for Definitional Contexts from Biomedical Corpora
In this paper we show a general advance about the desgin of a methodology for extracting definitional contexts from corpus of biomedicine in Spanish, taking into account a set of processes performed by the following modules: (i) a term extractor based in a hybrid method, (ii) a set of verbs that configure the syntactic structure of a definitional context, (iii) a chunker able to recognize those...
متن کاملECODE: A Definition Extraction System
Terminological work aims to identify knowledge about terms in specialised texts in order to compile dictionaries, glossaries or ontologies. Searching for definitions about the terms that terminographers intend to define is therefore an essential task. This search can be done in specialised corpus, where they usually appear in definitional contexts, i.e. text fragments where an author explicitly...
متن کاملExpanding HiErarcHical contExts for constructing a sEmantic Word nEtWork
A semantic word network is a network that represents the semantic relations between individual words or their lexical senses. This paper proposes Watlink, an unsupervised method for inducing a semantic word network (SWN) by constructing and expanding the hierarchical contexts using both the available dictionary resources and distributional semantics’ methods for is-a relations. It has three ste...
متن کاملرویکردی با ناظر در استخراج واژگان کلیدی اسناد فارسی با استفاده از زنجیرههای لغوی
Keywords are the main focal points of interest within a text, which intends to represent the principal concepts outlined in the document. Determining the keywords using traditional methods is a time consuming process and requires specialized knowledge of the subject. For the purposes of indexing the vast expanse of electronic documents, it is important to automate the keyword extraction task. S...
متن کامل